Picture for Junjie Wang

Junjie Wang

Department of Radiation Oncology, Peking University Third Hospital, Beijing, China

Decoupled Residual Quantization for Robust Semantic IDs in Recommendation

Add code
Jun 01, 2026
Viaarxiv icon

Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning

Add code
May 30, 2026
Viaarxiv icon

Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO

Add code
May 29, 2026
Viaarxiv icon

Benchmarking and Evolving Reason-Reflect-Rectify for Reflective Visual Generation

Add code
May 19, 2026
Viaarxiv icon

Where Did It Go Wrong? Capability-Oriented Failure Attribution for Vision-and-Language Navigation Agents

Add code
Apr 28, 2026
Viaarxiv icon

From Procedural Skills to Strategy Genes: Towards Experience-Driven Test-Time Evolution

Add code
Apr 16, 2026
Viaarxiv icon

AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications

Add code
Apr 01, 2026
Viaarxiv icon

Mitigating the Reasoning Tax in Vision-Language Fine-Tuning with Input-Adaptive Depth Aggregation

Add code
Mar 27, 2026
Viaarxiv icon

Linear-Nonlinear Fusion Neural Operator for Partial Differential Equations

Add code
Mar 25, 2026
Viaarxiv icon

Efficient Reasoning with Balanced Thinking

Add code
Mar 19, 2026
Viaarxiv icon